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Weak KAM Pairs and Monge-Kantorovich Duality 
Patrick Bernard and Boris Buffoni 

Abstract. 

The dynamics of globaUy minimizing orbits of Lagrangian sys- 
tems can be studied using the Barrier function, as Mather first did, 
or using the pairs of weak KAM solutions introduced by Fathi. The 
central observation of the present paper is that Fathi weak KAM 
pairs are precisely the admissible pairs for the Kantorovich problem 
dual to the Monge transportation problem with the Barrier function 
as cost. We exploit this observation to recover several relations be- 
tween the Barrier functions and the set of weak KAM pairs in an 
axiomatic and elementary way. 



§1. Introduction 

Let M be a compact connected manifold and consider a La- 
grangian function 

L : TM X M ^ R 
that satisfies the standard hypotheses of the calculus of variations, 
(LI) L{x,v,t+l) ^ L{x,v,t) on TM x R, 

(L2) dl^L{x,v,t) > on TM x R, 

(L3) lim L{x,v,t)/\\v\\ ^ oo on A/ x M. 

\\v\\ — >oo 

It is standard that, under these assumptions, there exists a well-defined 
time-periodic continuous vectorfield E{x, v, t) on TM such that the in- 
tegral curves of E satisfy the Euler-Lagrange equations associate to L. 
We assume in addition that this vectorfield generates a complete flow, 
and denote by ip the time-one flow, which is a diffeomorphism of TM. 

In this paper we show that the theory developed by Mather [IT], 
Mafie FT^ and Fathi [TU] amounts for a large part to the analysis of the 
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function A : M x M ^M. defined by the expression 

A{x, y) = min / L{-i{t), ^{t), t)dt, 
1 Jo 

where the minimum is taken on the set of curves 7 : [0, 1] — ^ M 
which satisfy 7(0) — x and 7(f) = y. 

To emphasize this point of view, we develop an abstract theory based 
solely on an arbitrary continuous function A : M x M M., where M is 
a connected compact metric space. We then define Ai ^ A and 

An{x,y) = min A{x, zi) + A{zi, Z2) + ■ ■ ■ + A{zn-i,y) 

Zl ,...,Zn-\^M 

for all integers n ^ 2. It turns out that the family (v4„) is equicontinu- 
ous and our only hypothesis on A is that the family (A„) is uniformly 
bounded (this can be achieved by adding some constant to A). It then 
follows that the expression 

c{x,y) = liminf yl„(a;,2;) 

n — >QO 

defines a continuous function c : M x M ^ M. 

We call (00 1 01 ) an admissible Kantorovich pair for c if 

\fy e M 01 (y) = min 0o(a;) + c{x, y) 

and 

Vx G Af 0o(a;) = max0i(y) — c(a;, y). 

The first main result (Theorem [12]) states that {(po, 0i) is an admissible 
Kantorovich pair for c if and only if 

• (j)o{x) = mayiy^M 4>o{y) - M^y v) for aU x € M, 

• 01 (x) = min^^gAf 01 (y) + A{y,x) for aU x G M, 

• and 00 (x) = 01 (cc) whenever c{x, a;) = 0. 

The second main result (Theorem I13|) concerns the minimization 
problem 

min / A{x,y)d7j{x,y), 

J MxM 

where the minimum is taken on the set of Borel probability measures 
rj on M X M with equal marginal measures, that is, 7rojj(?7) = 7rijj(?7) 
with ttq and tti denoting the canonical projections on M. Among all 
admissible measures, the minimizing ones are shown to be exactly those 
supported on the set 

D^{{x,y)e M X M I A{x, y) + c(y, x) = 0}. 
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This is also restated in the following way in Theorem 1151 Let 
X = be endowed with the product topology and denote by Mt{X) 
the set of Borelian probability measures on X which are invariant by 
translation. Consider the minimization problem 

min / A{xQ,Xi)di'{x)^ 
v^Mt(x) Jx 

where a generic x ^ X is written x = (■ ■ • , X-i, xq, 2^1, . . .)■ Then we 
show with the help of the Ergodic Decomposition Theorem that v in 
M.t{X) is minimizing exactly when the push-forward of v by the pro- 
jection X [xq^xi) is concentrated on D. 

The paper ends with the interpretation of these abstract theorems 
in the setting of the Aubry-Mather theory, recovering in this way some 
key results of [H [Til [10] . 

§2. Monge-Kantorovich theory 

We present some standard facts of Monge-Kantorovich theory, first 
in the general case, and then when the cost satisfies some given assump- 
tions. 

2.1. Generalities 

We recall the basics of Monge-Kantorovich duality. The proofs are 
available in many texts on the subjects, for example [U [151 [SI- We 
assume that M and N are compact metric spaces, and that c{x, y) is a 
continuous cost function on M x N . Given Borel probability measures 
/io on M and /ii on N , a transport plan between /xo and /ii is a measure 
on M X N which satisfies 

7rott(?/) = Mo and 7ri|j(77) = /zi, 

where ttq : M x iV — > M is the projection on the first factor, and 
TTi : Af X iV is the projection on the second factor. We denote by 

/C(/.to, /ii), after Kantorovich, the set of transport plans. Kantorovich 
proved the existence of a minimum in the expression 

(1) C(/xo,/ii)= min / cd-q 

for each pair (/io,/^i) of probability measures. The plans which realize 
this minimum are called optimal transfer plans. Let 0o be a real func- 
tion on M and (jji a real function on N. The pair (^Oj^i) is called an 
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admissible Kantorovich pair if it satisfies the relations 

01 (y) = min (i)o{x) + c{x,y) and (j)o{x) = max 01 (y) - c{x,y) 

for all point x G M and y E N. Another discovery of Kantorovich is 
that 



where the maximum is taken on the non-empty set of admissible Kan- 
torovich pairs {(j)Q, (f>i). This maximization problem is called the dual 
Kantorovich problem, the admissible pairs which reach this maximum 
are called optimal Kantorovich pairs. The direct problem ^ and dual 
problem ([2]) are related as follows. 

Proposition 1. If rj is an optimal transfer plan, and «/ (0o,0i) is 
a Kantorovich optimal pair, then the support of rj is contained in the set 

{(x, y) € M X N such that 4>i{y) — (f>o{x) — c(x, y)}, 

which is a closed subset of M x N because 0o o,iT-d 4>i are continuous. 

Let us remark that the knowledge of the set of Kantorovich admis- 
sible pairs is equivalent to the knowledge of the cost function c. 

Lemma 2. We have 



where the maximum is taken on the set of Kantorovich admissible pairs. 

Proof. This Lemma is elementary and can be proved by easy ma- 
nipulation of inequalities, see [1]. However, we present a short proof 
based on the non-elementary Monge-Kantorovich duality. Let us fix 
points x & M and y G N, and let be the Dirac measure at x and 
Hi be the Dirac measure at y. There exists one and only one trans- 
port plan between /io and /ii, it is the Dirac measure at {x,y). As a 
consequence, we have c{x,y) = C(/io,Mi)- Hence the equality above is 
precisely the conclusion of Kantorovich duality for the transportation 
problem between /ig and /j,i. q 

Proposition 3. Let (0o,0i) be an admissible pair, and let fio be a 
probability measure on M . Then there exists a probability measure jii on 
N such that the pair (00, 0i) is optimal for the transportation problem 
of the measure fiQ onto the measure fii 




c{x,y) 



max 01 (y) - 0o(a;) 

(00, 0l) 
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Proof. If /io is the Dirac at x, then take a point y such that 

4>i{y) = 4>o{x) + c(x, y), and observe that the conclusion obviously holds 
if jjLi is the Dirac at y. The set of measures /xq for which the conclusion 
holds (given (po,<Pi) is clearly convex and closed (with respect to the 
weak topology), it contains the Dirac measures, hence it is the whole set 
of probability measures. q 

2.2. Distance-like costs 

Kantorovich stated his duality theorem first in the case where M = 
N and the cost is a distance. Then, the dual problem takes a simpler 
form that we now describe. In fact, it is not necessary to assume that 
the cost is a distance. It is sufficient to assume that, for all x, y and z 
in M, we have 



A function (/) ; M — > M is called c-Lipschitz if it satisfies the inequality 

Hy) - Hx) < c{x, y) 

for all X and y in M. Note that, in the above and in what follows, we 
assume that M = iV is a compact and connected metric space, and that 
c:MxM—>M is a continuous cost function. 

Theorem 4. Assume that the cost c S C(M^,M) satisfies the as- 
sumptions (CI) and (C2). Then for each pair ^q, fi\ of probability 

measures on M, we have 



where the maximum is taken on the set of c-Lipschitz functions <j). 

This is a well-known direct rewriting of Kantorovich duality in view 
of the following description of admissible pairs. 

Lemma 5. If the cost saiisfies (CI) and (C2), then the Kantorovich 
admissible pairs are precisely the pairs of the form {4>,4>), with 4> c- 
Lipschitz. 

Proof. If (/) is a c-Lipschitz function, then {(f), (f)) is an admissi- 
ble pair. Indeed, let us prove for example that 4'{x) = miiiy 4){y) + 
c{y,x). On the one hand, we have <p{x) ^ (j){y) + c{y,x) because 



(CI) 



c{x,z) < c{x,y) -\-c{y,z) 



(C2) 



c{x, x) — 0. 
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is c-Lipschitz, hence (j){x) ^ mirij, ^(y) + c{y,x). On the other hand, 
4>{x) = 4>{x) + c{x,x) ^ m.vay(j){y) + c(jj,x). One can prove similarly 
that (t>{x) = maxy (j){y) — c{x,y). It follows that {4>,(f') is an admissible 
pair. Conversely, if {(f>Oi(t>i) is an admissible pair, then 0o = 0i is a 
c-Lipschitz function. This is a special case of Lemma [5] below. |— | 

Let us now study costs which satisfy (CI) but not necessarily (C2). 
It is then useful to define the set 

A:= {x e M, c{x, x) = 0} C M. 

Note that the restriction of the cost c to ^ x ^ obviously satisfies (CI) 
and (C2). In this more general case, we have: 

Lemma 6. Let c G C(A/^,M) satisfy (CI). Let (0o,0i) be an ad- 
missible pair. Then the functions (j)Q and (pi are c-Lipschitz. In addition, 
we have (j)Q ^ with equality on A. 

Proof. Let us first prove that the function 01 is c-Lipschitz. Given 
X £ M, there exists y such that 0i(a;) = <poiy) + c(y,x), and then, for 
each z, 

(t)i{x) = (f>Q{y) + c{y, x) ^ 4>i{z) - c{y, z) ^ c{y, x) ^ 0i(z) - c(x, z). 

One can prove similarly that 0o is c-Lipschitz. 
We then have 

4>q(x) — max 01 (y) — c(a:, y) ^ max0i(a;) — (j)i{x). 

V V 

because 4>i is c-Lipschitz. If a; £ we have, in addition, 

(l)o{x) = max(/)i(y) - c{x,y) ^ (/>i(a;) - c{x,x) = 0i(a;). 
y 

□ 

We now introduce another hypothesis which is certainly less natural 
than (CI) and (C2), but is useful for the applications we have in mind. 
We assume that 

(C3) ^7^0 and c(x,ij) ~ mmc(x,a) -\- c(a,y) 

aeA 

for each x and y in M. Note that, under (CI), (C3) is implied by (C2). 
The hypothesis (C3) implies that each optimal transport can be factored 
through the set A. 
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Lemma 7. // the cost satisfies (CI) and (C3), then for each pair 
(/io, of probability measures, there exists a probability measure fi sup- 
ported on A and such that 

C(^o, Ml) ^C{^io,fi) + C{fi, fii) 

Proof. First note that C{^o, Mi) ^ ^(^o, Mi) is true for all 

Borelian probability measures on M . This can be seen as follows. Let 
% and 771 be optimal transport plans for (^0, m) and ^i) respectively. 
Disintegrate ryo with respect to tti and 771 with respect to ttq: 7?o = 
jj^i Vozdniz) and r/i — Jj^j rji^d^^z) (see e.g. Theorem 5.3.1 in [2] for the 
disintegration theorem; here r^oz and 771^ are seen as probability measures 
on M). Following Section 5.3 in define the probability measure rj on 
M2 by 

r^{AxB)= ( rja,{A)7ji,iB) d^iiz) 
Jm 

for all Borelian subsets A,B C M. Then 77 e /C(mo, Mi) and 



cdrj^ c{x,y)driozix)drii;:{y)d^{z) 

M2 JM3 



^ / {c{x,z) + c{z,y)}dT]o;,{x)dr]i2{y)d^i{z) = / cdr]o+ cdrji. 
JM^ Jap Jm2 

Let us now prove the reverse inequality when /io and /ii are Dirac 
measures supported in x and y. In this case, one can take for /i the 
Dirac measure supported at a, where a is any point such that c{x,y) = 
c{x, a) + c{a, y). The general case is then deduced once again using the 
fact that, on M^, the set of probability measures is the closed convex 
envelop of the set of Dirac measures, so that we can approximate any 
optimal transfer plan in /C(mo; Mi) by Dirac measures. q 



Proposition 8. // the cost c e C{AP,R) satisfies (CI) and (C3), 
then for each admissible pair (00, 4>i), there exists a function : ^ — > M, 
which is c-Lipschitz, and such that 

01(a) = 00 (a) = 0(a) 

for all a a A, 

(3) 01 (a:;) = min 0(a) + c(a, x) 

for all X £ M and 



(4) 



00 (a;) — max 0(a) — c{x,a). 

a£A 
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Conversely, given any c-Lipschitz function (j) on A, the functions 0o o^c^ 
(/>! defined by ^ and (0) form an admissible pair. In other words, there 
is a bisection between the set of admissible pairs and the set of c-Lipschitz 
functions on A. 

Proof. The fact that 0o and <j)i are c-Lipschitz and that, on A, 
4>Q = (pi :— (t> results from Lemma [HI Let us prove (U), the proof of ([3]) 
being similar: 

(C3) 

(j)f){x) ^ m&yi<j)i{y) ~ c{x,y) = max 4ii{y) ~ c{x,a) - c{a,y) 

y yeALaeA 

= max(/)o(a) — c{x, a) — max0(a) — c{x, a). 

aeA aeA 

Conversely, let (f) he a, c-Lipschitz function on A, and let 00 and (/>! 
be defined by (|4]) and ([3]). The reader will easily check that (po and 
are c-Lipschitz, and that 0i ^ ^ 0o on A. We now prove that (/)o ^ (f>i 
(and then that there is equality on A): 

(po (x) — (f>i (x) = max (/>(a) — c(x, a) — (pib) — c(b, x) 

a,b(EA 

^ max (pia) — <pib) — c(6, a) ^ 

a,b^A 

because (p is c-Lipschitz on A. In order to check that the pair ((po,(j)i) 
is an admissible pair, we shall prove that 

(po (x) = max (pi (y) - c{x,y) 
y 

and leave the other half to the reader. For each x in M, we have 
(po(x) = max(/)(a) — c(x, a) = max0i(a) — c(x, a) ^ max0i(y) — c(x, y). 

a^A a£A y£M 

In order to obtain the other inequality, let us prove that 

(pi{y) - <Po{x) < c{x,y) 
for all X an y in AI. Indeed, we have 

01 (y) - (po{x) = min cp{a) + c{a,y) - (p{b) + c{x,b) 

a,beA 

^ min c{b, a) + c(a, y) + c(x, b) = min c{x, a) + c(a, y) = c(x, y) 

a.beA aeA 

by (C3). □ 
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Since 4>i and (f>2 are c-Lipschitz (Lemma H]) , equations and ^ 
imply 

(l)i{x) = mm (f>i{y) + c{y,x) and 0o(a;) = max0o(?/) - c(a;, y) 
for all X <E M. 

§3. Abstract Mather-Fathi Theory 

In this section, we consider a continuous function A{x, y) : M x M — > 
M. Recall that A/ is a compact connected metric space. We shall build 
several functions out of A. First, we define the sequence of functions 
An{xTy) by setting Ai = A and 

An ix,y) ^ mm A{x, z) + An-i[z, y) 

= min A{x,zi) + A{zi,Z2) + ■ ■ ■ + A(zn-i,y). 

zi,...,z„_i6M 

Lemma 9. The functions An are equicontinuous. In addition, there 
exists a real number I and a positive constant C such that 

\An{x,y) - /n| < C 

for all n £ N and all x and y in M . 

Proof. The function A is continuous, hence uniformly continuous, 
hence there exists a modulus of continuity S : [0, oo) [0, oo) such that 
lime^o S{e) = (5(0) = and such that 

\Aix, y) - A{X, Y)\ < 5{d{x, X)) + 5[d{y, Y)) 

for all x, y. X. Y in M. Clearly, for all n ^ 2 and all zi, . . . , z„_i e M, 
the function (x, y) A{x, zi) + A{zi, Z2) + . . . + A{zn-i,y) is uniformly 
continuous, with the same modulus of continuity as A. Hence An is 
uniformly continuous with the same modulus of continuity as A because 
it is the infimum of functions having all the same modulus of continuity. 

Let us define the sequences M„ := max(j. y)^^^ An{x,y) and to„ := 
mm (^r^ y-^^j^[2 An{x,y). It is clear that the sequence M„ is subadditive, 
i. e. that Mn+k ^ Afn + Mk for all n and k in N. In order to check this 
claim, we take x and y in M such that An+k{x, y) = Mn+k- Then there 
exists a point z in M such that 

Mn+k = An+k{x, y) = An{x, z) + Ak{z , y) < Mn + Mk- 
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Similarly, the sequence m„ is super-additive, i. e. TOn+fc ^ "nin + f^k- 
On the other hand, on view of the equicontinuity of An, there exists 
a constant C such that M„ — m„ ^ C. Applying a standard result on 
subadditive sequences (see e.g. Lemma 1.18 in [5]), we obtain that Mn/n 
converges to its infimum and that m„/n converges to its supremum 
m. Then for each x and y, 

nM - C ^ Mn-C ^rrin^ An{x, y) ^ A/„ ^ m„ + C ^ nm + C 

which implies that M = m and proves the Lemma. q 

We make, on the function A, the hypothesis 

(Al) I = 0. 

Note that this hypothesis implies that A{x, x) ^ for all x, and more 
generally that An{x, x) ^ for all x. Then, we can define a cost function 
c by the expression 



(5) c{x, y) =\imM An{x,y). 

n — ^oo 

In view of Lemma IHl the function c takes finite values and is contin- 
uous. We have c{x, x) ^ and, by Lemma [Til below. c{x, y) + c{y, x) ^ 
c(x, x) ^ for all x and y in M . 

Lemma 10. For each n G N, we have 

c(x, y) = min cix, z) + A„(z, y) ~ min An{x, z) + c(z, y). 
zeM zeM 

Proof. Let us fix n. Passing at the liminf (m oo) in the inequal- 
ity 

Am+n{x, y) < Ayn{x, z) + A„(z, y), 

we obtain 

c{x, y) < c{x, z) + An(z, y). 

For the opposite inequality, let us notice that, for each m, there exists a 
point Zm in M such that 

Am+n{x,y) = A„i{x, Zm) + A„(z„j,2/). 

Let us consider an increasing sequence of integers mk such that the 
subsequence Zm^ has a limit z and Vmik^oo A„i^+n{x,y) — c{x,y). At 
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the liminf, we get, taking advantage of the equicontinuity of the functions 
c{x, y) > c(x, z) + An{z, y). 

This proves that 

c(x, y) = minc(x, z) + A„{z, y). 

z 

The proof of the second equahty of the statement is similar. |— | 

Lemma 11. The cost function c satisfies (CI) and (C3). 

Proof. The triangle inequality is easily deduced from Lemma 1101 
Let us now prove (C3). We first prove that, given x and y in M, there 
exists a point z in M such that c{x,y) = c{x,z) + c{z,y). Indeed, 
for each n in N, there exists a point z„ such that c(x,y) = c{x,Zn) + 
An[zn,y)- Considering an increasing sequence of integers Uk such that 
the subsequence Zn^. has a limit z, we obtain at the liminf along this 
subsequence that c(x, y) ^ c{x, z) + c(z, y) which is then an equality. 

By recurrence, there exists a sequence Z„ G M such that, for each 
fc £ N, we have 

c(x, y) = c(x, Zi) + c{Zi, Z2) + ... + c{Zk-i, Zk) + c{Zk, y). 

Note that c(Zi, Z^+i) ~ c{Zi, Z^+i) if ^ ^ < m ^ fc, where 

Za = X and Z^+i = y. 

Let Z be an accumulation point of the sequence Z„. For each e > 0, 
we can suppose, by taking a subsequence in Z„, that all the points Z„ 
belong to the ball of radius e centered at Z. We conclude that, for each 

fc eN, 

c(x, y) ^ c(x, Z) + {k- l)c{Z, Z) + c(Z, y) - 2(fc + l)J(e). 

This is possible only if c{Z,Z) ^ 2(5(e), and since this should hold for 
all e we conclude that c{Z, Z) ^ 0, hence c(Z, Z) = 0. We have proved 
the existence of a point Z E A such that c{x, y) = c{x, Z) + c{Z, y). |— | 

Let us define, the two operators on the space C(M, R) of con- 
tinuous functions on M by the expressions 

T^uix) — min u(y) + A(y, x) 

and 

T'^u{x) — maxu(y) — A{x, y). 
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We have the following relation between the fixed points of these 
operators and the admissible pairs of the Kantorovich dual problem 
with cost c. Recall the definition A :— {x ^ M, c{x, a;) = 0} C M. 

Theorem 12. Let A be a function satisfying (Al), and let c be the 
cost defined by The pair {4)q, (j){) of functions on M is a Kantorovich 
admissible pair (for c) if and only if 

• the function 0o a fixed point of , 

• the function (pi is a fixed point of T~ , 

• (po — (pi on A. 

Finally, for each fixed point (pi of T~ , there exists one and only one 
function (pQ such that {(po,(j)i) is an admissible pair. 

Proof. Let {(po,(j)i) be an admissible pair. Then we have the ex- 
pression 

(pi{y) = min0o(a;) +c{x,y). 

We obtain that 

T~(pi{z)^ min 4)o{x) + c{x,y) + A{y, z) 

x,y(^M 

= min (Pq{x) + c{x, z) = (pi{z). 

x^M 

We prove in the same way that the function (po is a fixed point of T^. 
Lemma IHl implies that (pQ = (pi on A. 

Conversely, let {(po, (j)i) satisfy the three conditions of the statement. 
We first observe that the functions (po and (pi are c-Lipschitz. Indeed, 
we have, for each n, 

(l^iiy) - (pt{x) ^ An{x,y). 

When n = 1, this is a direct consequence of the fact that (pi is a fixed 
point of , and the general case is proved by induction. We get 

<Piiy) - 4>i{x) < liminf A„(a;,y) = c{x,y). 

n — *oo 

The function (pi being a fixed point of T~ , for each n £ N, there exists 
a point yn in M such that (pi{x) = 0i(j/„) + A„(y„, x). Indeed, we can 
find successively yi,y2, ■ ■ ■ such that 



(pi{x) = (pi{yi) + A{yi,x) ^ 01(2/2) + A{V2,yi) + A{yi,x) 

= ■ ■ ■ = 01 (yn) + A{yn,yn-i) + ■ ■ . + A{yi,x). 
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By definition of An, we get 4>i{x) ^ (t>i{yn) + An{yn,x). The reverse 
inequality has just been proved above. 

Let rifc be a subsequence such that ?/„^ has a hmit y. At the hmit, 
we obtain the inequafity 

01 (a;) ^ (f)i{y) + c{y,x), 

which is then an equahty. We have proved that 

(j)i{x) = min (f>i{y) + c{y,x). 

yeM 

Let us call cj) the common value of (f)o and 4>i on A. In view of (C3), we 
have 

01 (x) = min tpi (y) + c{y, a) + c(a, x) 

yeM.aeA 

= min 01 (a) + c(a, a;) = min 0(a) + c(a, x). 

aeA aeA 

One can prove in a similar way that 

00 (x) = max 00 (a) — c{x, a) = max 0(a) — c{x, a). 

a£A a£A 

We conclude that (0o,0i) is an admissible pair by Proposition [8] This 
also proves the uniqueness claim. 

In order to prove the last part of the statement, let us consider a 
fixed point 0i of T~ . Let us define the function 0o by 

0o(a;) = max 01 (a) — c(x,a). 

aeA 

Since the function 0i is c-Lipschitz (as seen above), we have 0o ^01- 
On the other hand, it is clear that 0i ^ 0o on A. As a consequence, we 
have 00 = 0i on A. By Lemma [TOl we have for all z S M that 

max 4io{x) — A{z, x) — max 0i(a) — c(x, a) — yl(z, x) 

x£M xeM,aeA 

= max 01 (a) — c(z, a) = (f>o{z). 

aEA 

Hence the function 0o is a fixed point of r+ and, as a consequence, the 
pair (00, 0i) is an admissible pair. r-i 
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§4. Dynamics 

Let us define the subset 

D := {(x, y) e M x M s. t. A{x, y) + c{y, x) ^ 0} (Z A x A 

(see Lemma [TU)) . We shall explam in two different ways that the Borel 
probability measures rj on M x M which are supported on D and satisfy 
'''otj(^) — ''^uiv) can be seen in a natural way as the analog of Mather 
minimizing measures in our setting. 

4.1. Construction via Kantorovich pairs 

We first expose a construction based on Kantorovich pairs. 

Theorem 13. Under the assumption {Al), we have 

min / A{x,y)dT]{x,y) = 0, 
^ Jmxm 

where the minimum is taken on the set of Borel probability measures 
rj on M X M such that 7ro(|(??) = 7riu(7y). The minimizing measures are 
those which are supported on D. 

Proof. Let us first prove that there exists a measure 77 on M x M 
which is supported on D and such that ttojj (ry) = ttij (77) . By Lemma [TUl 
for each S A^ there exists a point xi in A such that (a;o,a;i) € D. 
Hence there exists a sequence xq, 2:2, . . . x„, . . . of points of A such 
that {xmXn+i) G D for each n. Let us now consider the sequence 

5(xa,xi) + 5{xi,X2) H 1- 5{x„_i,x„) 

rjn = 

n 

of probability measures on A x A. Every accumulation point (for the 
weak topology) of the sequence rjn satisfies the desired property. Since 
the set of probability measures on Af x Af is compact for the weak 
topology, such accumulation points exist. 

Consider a measure rj on M x M which is supported on D and such 
that 7rou(»y) — 7ri[t(?7). We have 

j ^(a;, y)drj{x, y) = j -c{y, x)drj{y, x) ^ J <j){y) - (t){x)drj{x, y) = 0, 

where <j) is any c-Lipschitz function. 

On the other hand, let 77 be a probability measure on M x Af such 
that 7roj(77) = 'Ki^{rj). Consider a function which is A-Lipschitz. Such 
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functions exist, for example, take zi —>■ c(zi,Z2) for any zi G M (see 
Lemma [TUl) or fixed points of or r+ . We have 

(6) '^^ J '^^y'l ~ 't)ix)drj{x,y) J A{x,y)dT]{x,y). 

We have proved that the minimum in the statement is indeed zero, and 
that the measures supported on D are minimizing. There remains to 
prove that every minimizing measure is supported on D. 

It is clear that a measure 77 is minimizing if and only if, for each 
^-Lipschitz function (p, there is equality in which means that the 
measure rj is supported on the set 

Di = {{x,y)eM^\ 

(t>{y) — 4>{x) = A{x,y) for all A-Lipschitz functions 0}. 

Let I? 00 be the set of pairs (xq, xi) such that there exists a sequence xi, 
i G Z satisfying {xi,Xi-^-i) G Di for all z G Z (and of course with the 
given points xq and xi). 

We claim that Doo C D. In order to prove this claim, let 4> be A- 
Lipschitz. Observe that (ft is A„-Lipschitz for all n G N and c-Lipschitz. 
If (xqjXi) is a point in I? 00, then there exists a sequence Xi^ i such 
that 

0(xj) - (j){xi) = Aj-i{xi, Xj) 

for each i < j in Z. If a is an accumulation point of the sequence Xi at 
—00, we get the equality 

4>{xj) — 0(a) = c(a, Xj) 

for each j G Z and then, in the same way, c(a, a) = 0(a) — 0(a) = 0, 
hence a G .A. Let (00 , 0i ) be a Kantorovich pair for c, so that both 0o 
and 01 are A-Lipschitz (see Theorem [T^ . We get 0i(a) = 0o(a) (be- 
cause a G ^, see Theorem [T2t hence 0i(a;j) = (j)o{xj). Since this holds 
for all Kantorovich pairs, we get that xj G A (see Lemma [5]). In other 
words, we have proved that Doo C ^ x ^. Now let {xq, xi) he a. point 
of Doo- We have 2:1 G A, and, since the function c{xi, .) is A-Lipschitz, 
we have the equality c{xi,xi) — c{xi,xq) = A{xo,xi). Recalling that 
c{xi,xi) — 0, we get c{xi,xo) + A{xq,xi) = 0, hence (a:o, xi) G D. The 
proof of the Theorem then follows from the next Lemma. |— | 

Lemma 14. If rj is a probability measure on M x M which is sup- 
ported on Di and such that Tro^ii]) — 7rij(ry), then rj is concentrated on 
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Proof. Let us set fi = 7ro(|(?7) = T^iiiv) and let 

Xi =7ro(r>i)n7ri(£'i) cM 

be the set of points xq G M such that a sequence X-i,xo,xi exists, 
with (x_i,xo) G Di and (xo,a;i) S Di. Clearly, we have /z(7ro(-Di)) = 
/ti(7ri(-Di)) = 1 hence /ti(Xi) = 1. Let 

Da = Di n (Xi X Xi) 

be the set of pairs (xo,a;i) £ Af^ such that there exist a;_i, a;o, a;i, a;2 
with {xi,Xi+i) G Di for i = —1,0,1. Let 

X2 = 7ro(I>2) n 7ri(£)2) C M 

be the set of points xo £ M such that a sequence x_2, Xq, a;i, 0:2 
exists, with S £>! for all —2 ^ z ^ 1. Since = 1, we 

have r]{D2) = 1, hence /i(X2) = 1. By recurrence, we build a sequences 
D„ c M X M and X„ c M such that 

£>„ = £>i n (X„_i X X„_i) 

and 

X„ =7ro(Z?„)n^i(A0 C Af. 

By recurrence, we see that r]{Dn) = 1 and that /u(X„) 
have 

D^=f]Dn 

hence ri{D^) = 1. 

4.2. Ergodic Construction 

It is worth explaining that the preceding construction could have 
been performed in a quite different way, which does not use our theory 
of Kantorovich pairs, but relies on Ergodic theory, as the first papers of 
Mather. 

Consider X = endowed with the product topology, so that X 
is a metrizable compact space. We shall denote by Mt{X) the set of 
Borelian probability measures on X which are invariant by translation. 
More precisely, we denote hy T : X X the translation map 

T(. . . , a_2, a-i, ao, ai, 02, ...) = (... , 6_2, 60, &i, 62, • • •) 

with bi = fli+i for all i G Z, so that A4t{X) is the set of probability 
measures u on X such that Tf^i/ = u. 



= 1. Now we 



□ 
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Theorem 15. We have 



mm / 



A{xq,xi) dv{x) = 0, 



The measure v is minimizing if and only if its marginal ?/ = (ttq x tti)^v 
is concentrated on D. 

Note that Theorem [12] is equivalent to Theorem [T^] in view of the 
foUowing: 

Lemma 16. Let rj be a Borelian probability measure on such 
that 7rott(??) = 7ri|j(?7). Then there exists a Borelian measure v on X that 
is T -invariant and such that rj is its push-forward by the map X 9 a; — » 

Proof. This follows from the Hahn-Kolmogorov extension theorem 
(see e.g. Theorem 0.1.5 in [12], Lemma 10.2.4 in [7] and Theorem 12.1.2 
in [7]). Let SI be the algebra of finite unions of subsets G of X of the 
type G — Ili^zGi where Gi ^ M for at most a finite number of indices 
i (the number depending on G) and every Gi is a Borelian subset of 
M. We first define the T-invariant probability measure v on f2 and 
then apply the Hahn-Kolmogorov extension theorem, which provides an 
unique extension to the Borel cr-algebra (by uniqueness, the extension 
is T-invariant). 

Let ry = r]xj^d^{xi) be the disintegration of rj with respect to the 
projection 3 {xo,xi) ^ G M. In particular /i = 7rij(?7) (see e.g. 
Theorem 5.3.1 in ^ for the disintegration theorem). Define for m < n 



v{. . . y. M X M X G,n x . . . x G„ x M x Af x . . .) 




)d^l{Xn)- 



This is well defined because if Gi 



m— 1 



= M then 




(^m) • ■ ■ dfjx^i^Xji—i 



)dfl{Xn) 



)dfl{Xn) 
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and if Gn+i ~ M then 



L 



G„x...xG„xG„+i 

dr]x„,+Axm) ■ ■ ■d?]x^{xn-i) > d?7a:„+i(a;„)d^(a;„+i) 

G„xJ\/ I JG™x...xG„_i J 

) ■ ■ ■ dri^^{xn-i) > dr]{Xn,Xn+l) 
G„xM I JG,„x...xG„_i I 



dr]x^^iiXra) ■ ■ ■ dri^^{xn^i) ) d^{Xn) 
G„ I JG„x...xG„_i J 

drix^+i {xm) ■ ■ ■ drix„ (a;„_i)d/i(a:„) 

GmX...xG„ 

because ^ = 7i'i|j(?7) 7roj(77). 

Clearly = 1 and v is T-invariant on 17. |— | 

Although we have proved the equivalence between Theorem [T3] and 
Theorem [T3l we shall, as announced, detail another proof of Theorem [TSl 
For X € X and every Borelian subset B, we define 

tb{x) = lim - card{0 j < n - 1 1 r^(a;) £ B) 

(when the notation is used, it is understood that the limit exists). A 
Borelian probability v on X is ergodic if and only if, for every Borelian 
subset B C X, there holds tb{x) = I'iB) z^-almost surely. 

Following Section II. 6 in the book by Mafie [T2|, there exists a Borel 
set T, C X such that = 1 for each i/ £ AiriX), and, for each a; G E, 
the measure 

^ n — 1 

-.^ lim -^Stj^x] 

is well defined and ergodic, where the limit is understood in the sense 
of the weak topology, that is 

r 1 

(7) V/eC(X,R) / fd,yx= lim -V/(r^(x)). 

•'^ 3=0 

Moreover Vx £ AiriX) and x belongs to the support of Vx for all x € 12. 
In addition, still following T2], we have that the function x i — *■ J fdux 
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is z/-integrable and T-invariant, and that 

(8) J^(^J^fd,y,^diy = J^fdi^. 

holds for every f £ {X, v) . Note that the measure Vx is the conditional 
probability measure of v with respect to the cr-algebra of T-invariant 
Borel sets. 

We define the continuous function F : X — > M by T{x) = A{xo,xi). 
By standard convexity arguments, the following minimum is reached: 

a = min / r(x)diy(x). 
ueMT{x)Jx 

Let us prove that a < 0. Fix xq e M. For all e > 0, we can find 
n > 1 and xi,...,Xn & M such that 



^ n—l 

Xq and — ^{xj, Xj+i) < e 



3=0 



(thanks to assumption (Al)). Let x = {. . . ,xo, . . . ,Xn, ■ ■ ■) G X have 
periodic components with period n and define v G A4t{X) by 

^ n — 1 
j=0 

where 6t3(x) is the Dirac measure at T^{x). Then J^Tdu < e, which 
proves that a < (because e can be chosen arbitrarily small). 
Let u s M.t{X) be any optimal measure. The equality 

J (^j Tdv^dv = J Tdiy = a 

shows that ^du^ = a for z/-almost all x gT,. For such a x, we get 

^ n—l ^n— 1 

(9) 0>a= lim -^r(r^"(x))= lim - ^ A(a;,-, 

n— »+oo n n— »+oo n '-^ 



J=0 j=0 



Assume for a while that xq ^ A. Then there exists a neighborhood U 
of Xq in M, ^ > and N ^ \ such that 

(10) An{yQ, zo) > <5 > for all yo, zqGU and n'^ N 
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(we use here the equicontinuity of the functions An). Setting U ~ {y € 
X\yoe [/}, we get 

< Vx{U) ^ liniinf - card{0 ^ j ^ n - 1 | a;,- e U). 

The first inequahty is a consequence of the fact that x is in the sup- 
port of Ux and the second one follows from ([7]) and the fact that the 
characteristic function of U is the supremum of an increasing sequence 
of continuous functions. We denote by (a;^^ : k ^ Q) the sequence of 
components of a; in t/ (of non-negative index). We obtain (see UHl)) 

< VxiU) ^ limmf 

and the contradiction 

jmN-i m-iicfc+ijN-i 
liminf > A(a;,-, = liminf > N A{xj,Xj+i) 

'm — 1 



77i(5 — 

^ liminf ^ Vx{U)5lN > 

(compare with This contradiction shows that xq G A for j/-almost 
all X, that is, the marginal fi — ■kq^v is concentrated on A. 

Let us now check that a ^ Q. For contradiction, suppose a < 0. 
Then for a; G S as above such that € AAt{X) and V{vx) — a, we get 

I 





> a = r(j/j;) = lim -"S^ A{xj,x,+i) 

n— *4-oo 77, ^- — ^ 

= lim — — A(a;j,a;j+i) + ^(a;„,a;o) 
^ lim sup — —An+i(xQ,xo). 

n— + 00 n + 1 

This contradicts / = (see hypothesis (Al)). 

We have proved that a = 0, and that every minimizing T-invariant 
measure v has its marginal ^ — vroji^ concentrated on A. Let us now 
prove that every minimizing measure v e Mt{X) is supported on {a: S 
X \ A[xa,xi) + c(xi,a:o) = 0}. Let x belong to the support of v and 
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observe that (see Lemma [TU|) c{xi,yi) ^ c{xi,yo) + A{yo,yi) for all 
2/0:2/1 G M. Therefore 

= / c{xi,yi) - c{xi,yo)diy{y) / A{yo,yi) dv{y) = a = 
Jx Jx 

and 

/ Myo, yi) - c{xi,yi) + c{xi,yo) dv{y) = 
Jx 

where the integrand is non negative. Hence c{xi,yi) = c{xi,yo) + 
A{yQ,yi) for I'-almost all y. Since x is in the support of j^, we get 
c{xi,xi) = c{xi,xo) + A(xo,xi). We have just seen that yg G A for u- 
almost all y. By the T-invariance of v, we also have yi ^ A for i/-almost 
all y. Since x is in the support of we therefore obtain xi £ A and 
= 0(2:1, xi) = c{xi,Xo) +A{xq,xi). 

Finally let ly G AiriX) be concentrated on 

D = {yeX\ A{yo, yi) + c(?;i, yo) = 0} 

and let us prove that Tdv — a. By ([5]) applied to the characteristic 
function of D, we get that Vx{D) = 1 for i^-almost all a; G S. By (|8]) 
applied to F, we see that it suffices to check that Tdv^ — ol for all 
X G E such that Vx is concentrated on D. This follows from (CI): 



= a< / A{yQ,yi)dvx{y) ^ - I c{yi,yo) dv^iy) 
Jx Jx 

ri-l 



1 1 

= — lim — y c(x,+i,2:j) ^ — liminf — c(a:„, Xq) = 0. 

□ 



§5. Aubry-Mather theory 

We now briefly explain the relations between our discussions and 
the literature on Aubry-Mather theory, and especially [11] , [M] and [10] . 
From now on, the space M is a compact connected manifold and we con- 
sider a Lagrangian function L : TM x M ^ R as in the Introduction. 
In this context, we define A : M x M ^ R by 



A{x,y)^mm [ L{j{t),j{t),t)dt, 
T Jo 
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where the minimum is taken on the set of curves 7 : [0, 1] — > M 
which satisfy 7(0) = x and 7(1) = y. 

The function c defined by ^ is one of the central objects of Mather's 
theory of globally minimizing orbits, see [11]. He called it the Peierls 
barrier. It contains most of the information concerning the globally 
minimizing orbits, as was explained by Mather, see also [3]- The set A 
of points X € M such that c(x,x) = is called the projected Aubry 
set. It is especially important because Mather proved the existence of 
a vectorfleld X{x) on A whose graph is invariant under the Lagrangian 
flow If. This invariant set is called the Aubry set. The analog of the 
Aubry set in our general theory is the set D defined in the beginning of 
section H) 

The operators have been introduced by Albert Fathi in this 
context, see [H],[S] and [TD]. He called Weak KAM solutions the fixed 
points of T~ , and we call backward weak KAM solutions the fixed points 
of r+. He also noticed that, for each weak KAM solution (pi, there exists 
one and only one backward weak KAM solution 0o which is equal to 
on the projected Aubry set. This is the main part of our Theorem 
[T^ Albert Fathi also proved Lemma [5] in this context. Our novelty in 
these matters consists of pointing out and using the equivalence with 
Kantorovich admissible pairs, which allows, for example, a strikingly 
simple proof of the important result of Fathi called Lemma [5] in our 
paper. The representation of weak KAM solutions given in Proposition 
|5]was obtained by Contreras in .6j. 

The minimizing measures of Theorem [T3] are the famous Mather 
measures, see |11| . To be more precise, we should say that there is a 
natural bijection between the set of minimizing measures in Theorem 
[island the set of Mather measures. This bijection is described in 4j. 
In order to give the reader a clue of this bijection, let us recall that 
the Mather measures are probability measures on the tangent bundle 
TM, and that the minimizing measures of Theorem 1131 are probability 
measures on M x M. Denoting by ip the time-one Lagrangian flow, 
and by tt : TM — > M the standard projection, we have a well-defined 
mapping (tt, tt o (p)^ from the set of probability measures on TM to the 
set of probability measures on M x M. This mapping induces a bijection 
between the set of Mather measures on TM and the set of minimizing 
measures of Theorem 1131 

The part of Theorem [T3| stating that the minimizing measures are 
precisely the measures supported on D is the analogous in our setting of 
the theorem of Mane stating that all invariant measures supported on 
the Aubry set are minimizing, see [13j . 
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